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ABSTRACT 

The  follovlng  conjecture  Is  considered: 

(*)  It  is  unsolvahle  whether  a  language  (■  context  free  language) 
contains  a  sequence. 

VThlle  this  conjecture  is  left  unresolved,  a  nunber  of  results  pertaining  to  it 
are  obtained.  For  exsaple,  the  unsolvablllty  of  whether  a  language  contains 
the  set  {abaS)a^. .  .ba’^/n  2  1},  iaplies  (*).  It  is  shown  that  (*)  is  equivalent 
to  the  ujuolvablllty  of  whether  a  language  contains  a  chain  of  a  speeiad  fom. 
Several  facts  about  whether  a  language  contains  a  specific  sequence  are  also 
demnstrated.  In  particular,  it  is  shown  that  whether  a  language  contains  a 
given  sequence  is  unsolvable,  but  whether  a  language  contains  a  given  ultimately 
periodic  sequence  is  solvable. 
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80MB  REMARKS  ABOUT  SBftUEBCBS  U 
CGHTEXT  FREB  LA1I0UA(^ 


Introduction 

In  [6]  it  vaa  shotm  by  a  complicated  argvnent  that  for  two  (context  free) 
languages  1^  and  1^,  it  is  recursively  unsolvable  whether  there  exists  a  coai> 
plate  sequential  nachlne  aappli^  1^  into  How  an  alternative  (and  quite 
sli9le)  proof  of  this  fact  would  follcw  fron  verification  of  the  following 
conjecture:  It  is  recursively  unsolvable  whether  a  language  contains  an 
ultiaately  periodic  sequence.  [For  the  language  {a!^/n  i  l}  can  be  napped  into 
an  arbitrary  language  L  by  a  cosqplete  sequential  nachlne  if  and  only  if  L  con¬ 
tains  an  ultlnately  periodic  sequence.]  Neither  this  conjecture  nor  its 
analogue  for  sequences  in  general  has  been  settled,  but  they  have  provided 
notivation  for  the  study  of  sequences  In  languages.  The  present  paper  sets 
forth  several  results  about  sequences  in  languages  and  shows  how  seme  of  the 
questions  which  renaln  unanswered  nay  be  reduced. 

The  paper  is  divided  into  four  sections.  Section  1  reviews  the  texvinology 
of  languages.  In  section  2  a  language  is  edilbited  which  contains  exactly  one 
sequence,  a  sequence  which  is  not  ultlnately  periodic.  Using  this  language  it 
is  shown  that  the  unsolvability  of  whether  a  language  contains  a  set  of  words 
of  a  certain  fona  im>lles  the  unsolvability  of  whether  a  language  eontadns  a 
sequence.  In  section  3  the  solvability  of  a  language  contadnlng  a  sequence  is 
proved  equivalent  to  the  solvability  of  a  language  containing  at  least  one' 
special  kind  of  chain.  In  section  4  it  is  shown  that  whether  a  language  con- 
tadns  a  given  sequence  is  unsolvable,  but  whether  a  lauoguage  contains  a  given 
ultlnately  periodic  sequence  is  solvable. 
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Section  1.  PrellBlnarlet 

Let  r  be  a  finite  noneapty  eet  and  let  6(E)  be  the  free  sealgroup  with 
identity  c  generated  by  E.  (Thue  6(E)  le  the  eet  of  all  finite  sequences,  or 
words,  of  E  and  c  is  the  eopty  sequence. )  Ve  shall  be  eonsidoring  certain  sub¬ 
sets  of  6(E)  which  are  called  "context  free  languages,"  or  "languages"  for 
short.  These  languages  curose  in  the  study  of  natural  languages  [2]  and  have 
been  shewn  to  be  identical  with  the  ccsqponents  la  the  "ALOOL-llke”  artiflcicd 
languages  \dilch  occur  in  data  processing  [U]. 

A  G  is  a  U-ivq>le  (V,P,E,S),  where  V  is  a  finite  set,  E  is  a  subset 

of  V,  S  is  an  elenent  of  V-E,  ani  P  is  a  finite  set  of  ordered  pairs  of  the 
fom  (%,v)  with  ;  in  V-E  and  w  in  6(v).  P  is  called  the  set  of  productions  of 
0.  An  element  (^,v)  in  P  is  denoted  by  $  -•  w.  If  x  and  y  are  in  6(V),  then 
we  write  x  •  y  if  either  x  ■  y  or  there  exists  a  sequence  x  ■  x^,  x^, . . . ,x^  * 
y  (n  >  1)  of  elements  in  6(V)  with  the  following  property.  For  each  i  <  n  there 
exist  bj^,  5^,  such  that  x^^  -  and  5^^  -  w^^.  The 

language  .generated  by  G,  denoted  by  L(g),  is  the  set  of  words  fw/S  *  w,  w  in 
6(E)'|.  a  context  free  language  (over  E)  is  a  language  L(0)  generated  by  seme 
gramar  G  ■  (V,P,E,S).  Uhless  otherwise  stated,  by  a  language  we  shall  always 
mean  a  context  free  language. 

If  A  and  B  are  subsets  of  d(E),  then  the  set  of  words  {ab/a  in  A,  b  in  B) 
is  called  the  product  of  A  and  B  and  is  written  AB.  If  A  (or  B)  consists  of 
Just  one  word,  say  A  -  fa}  (B  -  fb)),  then  aB  (Ab)  is  written  instead  of  AB. 
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For  each  word  x  ia  9(£)«  |x|  denotes  the  length  of  x. 

fl)  *(2) 

If  A  sad  B  are  languages,  then  so  are  AB,  A  -f  B,'  '  and  A  [3]* 

Ihe  faadly  of  regular  sets  is  characterised  as  the  smallest  fsadly  of  sub¬ 
sets  of  6(1;)  containing  the  finite  sets  and  closed  under  the  operations  of 
union,  product,  and  *  r9]>  ^ch  'egular  set  is  a  language  [3]. 

Let  x^, . ..,x^, ...  (vritten  x^. ..x^. ..)  be  an  infinite  sequence  of  elements 

of  £.  Ihe  sequence  is  sadd  to  be  contained  in  a  set  H  of  words  (or  H  contains 

the  sequence)  if  the  word  ..x^^  is  in  H  for  each  1.  A  sequence  Xj^...Xjj. ..  is 
said  to  be  ultimately  periodic  (u.p. )  if  there  exist  Integers  Hq  and  p  so  that 

■  x^  for  n  ae  Oq.  An  infinite  sequence  of  words  fw^^}  is  said  to  be  a  chain 

if  Wj  is  an  initial  subword  of  w^^^^  for  each  i,  l.e.,  w^^^^  -  w^u^  for  soeie  u^ 
in  B(£}-s.  A  language  is  said  to  contain  a  chain  if  the  language  contains  each 
word  in  the  chain. 

We  are  Interested  in  whether  a  language  containing  an  u.p.  sequence  is 
unsolvable  and  whether  a  language  contcdnlng  a  sequence  is  unsolvable;  and  we 
shall  present  a  number  of  results  which  have  arisen  during  a  study  of  these 
two  probleu. 


^^^Both  "+"  auid  are  used  to  denote 

(2)  # 

If  A  is  a  set  of  words,  then  A  ■  t 


set  union. 

m 

+  U  A®,  where  A^ 
1 


A  and  A^*^  -  A^A. 


31  January  1963 


-8- 


aM-738/OOl/OO 


Section  2.  Dlstlneulehed  Seguencaa 

Consider  the  question  of  whether  or  not  a  language  containing  a  sequence 
must  contain  an  u.p.  sequence.  By  application  of  a  systematic  procedure,  ve 
can  effectively  enumerate  those  languages  which  contain  no  sequences.  Since  we 
can  test  a  given  language  to  see  If  It  contains  a  specified  u.p.  sequence, 
and  since  ve  can  effectively  enumerate  the  u.p.  sequences,  ve  also  have  a 
systematic  procedure  for  effectively  enumerating  those  languages  which  contain 
u.p.  sequences,  therefore.  If  each  language  containing  a  sequence  contained  an 
u.p.  sequence,  ve  would  have  a  decision  procedure  for  deteimlnlng  whether  or 
not  a  given  language  contained  an  (u.p. )  sequence.  However,  ve  now  show  that 
there  are  languages  which  contain  a  sequence  but  no  u.p.  sequence. 

Notation.  Given  a  word  v  and  an  element  b  In  £,  let  #b(w)  be  the  number  of 
occurrences  of  b  in  w. 


Theorem  2.1.  There  exists  a  language  which  contains  a  sequence  but  no  u.p. 
sequence. 

Proof.  Let  where  Z  -  fa,bl,  -  5  U  L,  and  P^  -  f?  -  b, 

?  -  a5,  5  -  b?a).  Let  -  L(Gj^).  Clearly  -  fu/u  -  wba^^'^^,  w  In  e(a,b)). 

If  u  ■  wba“  Is  in  Lj^,  then  wba“ba°'*’^  is  in  Lj^  and  is  the  proper  extension  of  u 

2 

In  L^  of  smallest  length.  Hence  L^  contains  the  chain  Cab,  bba,  bbaba  , 
bbaba^ba^, .... 


'See  theorem  ^.2. 
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Let  Gg  -  (VgjPg,!:,?)*  wl»®re  Vg-S  -  C5,tf,Y)  and  Pg  -  (5  -  vy,  y  -  ba, 

Y  -•  aiy,  Y  ••  Y  »  *•  'b»  v  •*  aw,  w  -•  bw,  w  -•  wa}.  Let  1^  -  L(Gg). 

Oien 

■  fu/u  ■  wba®,  1  «  n  <  ji*b(v),  w  In  8(a,b)  b6(a,b)). 

Let  L^  >  b  +'  L^b  +  Mote  that  each  word  in  b  -f  L^b  ends  in  b,  and  each 
word  In  ends  in  a.  Let  D  be  the  sequence  bbaba^ba^. . .  .  Obviously  D  is 
not  u.p.  Ve  shall  show  that  L^  contains  D  but  no  other  sequence. 

Bach  vord  in  D  ending  in  b  is  in  b  -f  L^b.  Since  each  vord  in  D  ending  in 
a  is  in  Ig,  D  is  contained  in  L^.  Now  let  E  be  any  sequence  contained  in  L^. 
Neither  a  nor  ba  is  in  Lg,  thus  neither  is  in  L^.  Therefore  E  begins  vith  bb. 
Now  8vq)pose  that  E  begins  vith  buba*^,  n  2  0,  for  sone  vord  u  in  6(a,b).  Two 
cases  arise. 

(or)  n  a  #b(bu).  Then  buba°'*’^  is  not  in  Lg,  and  thus  not  in  L^.  Hence 
E  must  begin  with  buba%. 

(S)  n  <  ijlb(bu).  Since  n  ^  #b(bu),  buba’’*  is  not  in  L^.  Thus  E  cannot 
begin  vith  buba%,  that  is,  E  begins  vith  buba^'*’^. 

By  induction  it  therefore  follows  that  E  ■  D.  Hence  D  is  the  only 
sequence  contained  in  L^. 

Using  the  languages  constructed  in  Theorem  2.1  we  now  show  that  if  either 
of  two  problems  is  recursively  unsolvable,  then  so  is  the  problem  of  whether 
a  given  language  contains  a  sequence. 
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Let  Dj  1^4  and  be  as  In  nieoreB  2.1  and  let  6  ■  6(a,b).  For  M  and 

R  Bubsets  of  e,  let  -  b  +  L,b  +  U  (Hb)“‘*’H!ba“''’^.  It  la  readily  seen 

^  n-0 

that  t(M,R)  1b  a  language  If  M  and  N  are.  Purthennore,  c  if 

c  M  and  c  N.  Then 

t(9,0)  -  b  +  L.b  ♦  U  (eb)“'*^a“”*’^ 

^  n-0 

-  b  +  Lj^b  + 


L 


3* 


Since  D  1b  the  only  aequence  in  L^,  it  foUovB  that  contains  a  sequence 

if  and  only  if  it  contains  the  sequence  D. 


w 

Consider  T(M,a*)  -  b  +  L,b  +U  (a*b)*^"*’'*Mba®'*’^.  If  this  set  is  to  contain 

0 

D  then  it  is  necessary  that  M  (i)  contain  each  of  the  voids  e,  ba,  baba  , 
babaS>a^,...  (to  obtain  members  in  the  sequence  of  the  form  vba);  and  (ii)  con> 
tain  each  word  of  the  form  .  .ba**^,  n  a  0,  m  a  1  (to  obtain  members 

in  the  sequence  of  the  form  wba*^"^,  with  m  -  #b(w)-n-l).  Moreover,  if  M 
satisfies  (1)  and  (il),  then  it  is  readily  seen  that  D  c  T(M,a  }  since 
D  n  0b  c  b  +  Lj^b. 


Sv^pose  that  M  «  bP  +  a0  c.  Then  (ii)  is  satisfied;  and  (i)  is  satisfied 
if  and  only  if  P  contains  the  set  faba  . ..ba  /n  a  l).  nius  T(M,a  )  contains  a 
sequence  if  and  only  if  P  contains  (aba^. .  .ba’^/n  a  l}.  Therefore  we  get 

Theorem  2.2.  If  whether  a  language  contains  (aba^. .  .ba’^/n  a  1}  is  unsolvable, 
then  whether  a  language  contains  a  sequence  is  unsolvable. 
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Suppoae  Uiat  M  ■  P  a  4  b9.  Then  (i)  le  estlsfled,  an&  (ii)  le  satisfied 
for  B  ■  1.  Nov  (li)  holds  for  every  1  If  and  only  if  P  contains  each  vord  of 
the  form  aW^..ba^^  1,  k  a  1.  Thus  ve  have 

Theorem  2.3»  If  whether  a  language  contains  faSa^*''^. . .  k  2  ll  is 

unsolvable,  then  whether  a  language  contains  a  sequence  is  unsolvable. 

We  now  consider  sequences  D  with  the  proi>erty  that  there  is  a  language 
containing  D  and  no  other  sequence. 

Definition.  A  sequence  D  of  words  with  the  above  property  is  called  a 
distinguished  sequence. 

Since  every  u.p.  sequence  is  a  language,  each  u.p.  sequence  is  distin¬ 
guished.  The  sequence  D  in  Theorem  2.1  shows  that  the  converse  is  not  true, 
i.e.,  there  are  distinguished  sequences  which  are  not  u.p. 

Given  a  distinguished  sequence  D  ve  may  obtain  other  distinguished 

(h) 

sequences  as  follows.  Let  S  be  any  cos^lete  sequential  machine^  '  with  the 

(U) 

'  'k  generalized  sequential  machine  S  is  a  6-tuple  (K,i:,A,6,X,Pj^)  where  (i)  K 
is  a  finite  nonempty  set  (of  "states");  (ll)  S  is  a  finite  nonempty  set  (of 
"inputs");  (iii)  d  is  a  finite  nonempty  set  (of  "outputs");  (iv)  6  is  a  mapping 
of  K  X  C  into  K  (the  "next  state"  function);  (v)  X  is  a  mapping  of  K  X  £  into 
6(d)  (the  "output"  function);  and  (vl)  p^  is  an  element  of  K  (the  "start" 
state).  A  complete  sequential  machine  is  a  generalized  sequential  machine  in 
which  X  nape  K  X  £  into  d. 
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property  that  at  each  state,  X  naps  S  one  to  one  into  t.  Let  L  be  a  language 
containing  a  distinguished  sequence  D.  nxen  S(L)^^^  is  a  language  [$]  con¬ 
taining  the  sequence  S(d).  That  S(l)  contsdns  no  sequence  but  S(D)  follows 
froB  X  napping  S  one  to  one  into  L.  Furthensore,  if  D  is  not  u.p. ,  mither  is 
S(D).  Ve  onlt  the  straightforward  details. 

Die  question  natureilly  arises:  Are  there  any  sequences  which  are  not 
distinguished?  A  sisqtle  cardinality  argument  shows  there  are.  For  there  are 
2  sequences  when  S  contains  at  least  two  elements,  and  only  languages, 
mius  there  exists  a  sequence  D  (in  fact  2  )  such  that  any  language  containing 

D  contains  at  least  one  other  sequence,  i.e.,  a  sequence  D  which  is  not  dis¬ 
tinguished.  More  precisely. 

Theorem  2.U.  Every  distinguished  sequence  is  recursive. 

Proof.  Let  L  c  6(aj^, . . .  ,aj^)  be  a  given  language.  We  outline  am  effective 
procedure <P  with  the  following  property:  If  L  contains  at  least  one  sequence 
(w,  1  -  and  every  sequence  in  L  contains  w_,  then  P  selects  w  at  the  n-th 

stage. 

Each  stage  of  P  is  divided  into  substages.  In  substage  B(a  1}  of  stage  1, 
P  detexnlnes  the  finite  set 

^Extend  6  and  X  to  K  x  9(t}  as  follows.  Let  6(q,c)  -  q  and  X(q,c)  ■  s.  For 
each  word  x^...Xj^^^,  each  x^  in  E,  let  6(q,Xj^. . -  6[6(q,Xj. . .Xj^), 

••eh  word  w,  let 

S(w)  -  X(p_j^,w).  For  each  set  L,  let  S(L)  -  fs(w)/w  in  L). 
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D,  ^  ■  fw/  |wl  •  ■  and  Inlt  l}. 

If  _  l8  eopty,  P  selects  a.  and  proceeds  to  stage  2.  If  .  is  noneopty 

XjB  JL  XpB 

and  each  vord  la  it  begins  vlth  the  sane  letter,  say  b^,  P  selects  b^  suid  pro¬ 
ceeds  to  stage  2.  Othenrlse  P  proceeds  to  substage  a  +  1. 

If  L  contains  at  least  one  sequence  and  every  sequence  In  L  contains  b^, 
then  there  oust  be  sane  m  k  1  such  that  _  Is  nonennpty  cmd  contains  only  words 

XpB 

beginning  vlth  b^.  (nils  follows  Inmedlately  fron  the  vell-knoim  Infinity 
Lenna  of  graph  theory  r7;p«8l]).  In  this  case  P  coaq^letes  Its  first  stage  and 
selects  b^. 

Suppose  that  P  conpletes  the  n  stage  (n  x  l)  singling  out  b,...b  .  In 

X  n 

substage  B  (ae  1)  of  stage  n  'f  1,  P  determines  the  finite  set 

°B+l,m  ■  •  a  +  m,  Inlt  w  c  L,  and  'by .  .b^^  Is  In  Inlt  w}. 

If  ^  selects  proceeds  to  stage  n  +  2.  If  ^ 

Is  noneopty  and  every  word  In  It  has  the  saaie  n  +  1*^  letter,  say  then  P 

selects  *“*4  proceeds  to  stage  n  +  2.  Otherwise  P  proceeds  to  sub¬ 

stage  B  4  1.  If  L  contains  at  least  one  sequence  and  every  sequence  In  L  con¬ 
tains  then  there  Bust  be  sane  b  a  1  such  that  is  noneopty  and 

contains  only  words  with  b^^^  as  Its  n  4  1*^  letter.  In  this  case  P  eoiqpletes 
Its  a  4  1*^  stage  and  selects 

^^^For  a  word  w,  Inlt  w  ■  fu/u  ^  «,  uv  ■  w  for  Sam  v}.  For  a  set  B  of  words 

let  Inlt  H  -  U  Inlt  v. 
w  In  H 

s 

I 

f 

I 
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Suppose  that  L  contains  exactly  one  sequence  D.  Then  P  coopletes  the  n 
stage  for  every  n  and  enisnerates  D.  Since  P  is  an  effective  procedure,  D  Bust 
be  recursively  envaiterable.  But  a  sequence  is  recursively  enuserable  if  and  only 
if  it  is  recursive.  Hence  the  result. 

The  next  theorem  shows  the  existence  of  recursive  sequences  which  are  not 
distinguished.  Since  the  proof  involves  special  concepts,  it  is  given  in  the 
appendix. 

Theorem  2.5.  Let  a  be  a  given  element  of  S.  Oien  each  recursive,  non  u.p. 
sequence  D  with  the  property  that  for  every  n  >  1  there  is  a  word  ua  ,  u  ^  c, 
in  D  such  that  k  i  is  not  distinguished. 

In  passing,  we  mention  two  open  problems. 

(1)  Characterise  the  set  of  distinguished  sequences. 

(2)  Characterize  the  set  of  those  sequences  D  having  the  property  that 
there  exists  a  language  containing  D  but  no  u.p.  sequence. 

Section  3.  An  Equivalence  Condition 

We  now  show  that  the  solvability  of  a  language  containing  a  sequence  is 
equivalent  to  the  solvability  of  a  language  containing  a  special  kind  of  chain. 

(7) 

'One  such  sequence  D  -  Xj^...Xjj...  is  obtained  by  letting  f(0)  -  1,  f(n4l)  - 
f(n)  +  2^®'*’^^^^“^+  1  for  n  X  0,  x^  ■  b  if  i  is  in  the  range  of  f,  and  • 


otherwise. 


31  Januwry  1963 


15- 


01-738/001/00 


^eorem  3.1.  Cedi  C(P)  •  eountiag  chain,  where  P  is  a  subset  of  the  positive 
integers,  if  C(P)  is  the  set  of  those  words 


k 

w  ft  u  ft  •  •  •  D  ft  p 


k  >  1,  such  that  for  1  £  J  <  k,  ij  >  2  if  J  is  in  P  and  ij  ■  1  if  J  is  not  in 
P.  Ihen  the  question  of  whether  an  arbitrary  language  contains  a  sequence  is 
solvable  if  emd  only  if  the  question  of  whether  eui  arbitrary  language  contains 
a  counting  chain  is  solvable. 


Proof.  (1)  Let  a  be  the  operation  which  takes  each  occurrence  of  b  into 
fb,b^),  and  leaves  a  unchanged.  Ihat  is,  a(c)  ■  <  and  o(x^.  ..x^)  ■  a(x^). . .a(x^), 

O 

where  a(a)  ■  a  auad  o(b)  -  {b,b  ).  Let  t  be  the  operation  which  takes  each 
occurrence  of  a  into  ■  fba*^/n  a  l)  and  each  occurrence  of  b  into  Ag  ■ 

fba^'^Vo  *  Since  A^  and  Ag  are  languages,  o  and  t  preserve  languages,  l.e., 
if  L  is  a  language,  so  are  9(L}  and  t(L)  [1].  We  shall  show  that  for  a  subset 
L  of  9,  ot(L)  contains  a  counting  chain  if  and  only  if  L  contains  a  sequence. 
Therefore  if  it  is  solvable  whether  an  arbitrary  language  contains  a  counting 
chain,  then  it  is  solvable  whether  an  arbitrary  language  contains  a  sequence. 

To  this  end  let  L  be  a  subset  of  9.  Suppose  that  L  contains  a  sequence 
D  ■  .  .x^. . .  .  For  each  n  let  -  Xj^. .  .x^^.  Let  A^  ■  fl}  if  Xj^  ■  a  and  A^  ■ 

cp  if  x^  ■  b.  Let 

P  -  A^  fn/n  a  2,  -  x^^). 

i,  io  K 

For  each  n,  ba  ^...ba  is  in  T(d  )  if  cuod  only  if 

o 
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fj/j  <  n,  Ij  >  0  and  even}  ■  fj/j  £  n,  ■  a}. 

For  each  n  let  u  be  the  element  in  T(d  )  with  i.  -  1  or  2  and  1,.,  ■  1  or 

n  n  X  j+x  j 

^J+1  "  1  <  i  <  n.  Then 

k  iL  hi 

o(Ujj)  -  fb  ^a  H  ^...b  “a  “/kj  -  1,2;1  £  J  £  n). 

In  particular,  o(u^)  contains  an  elemnt  In  which  kj  ■  2,  1  £  J  £  n.  If  and 
only  if  J  la  In  P.  Ihus  k^  >  2  if  and  only  if  x^  ■  a.  Hence  k^  ■  1^.  Further* 
more,  for  each  J,  kj^^^  ■  2  If  and  only  if  Xj  >  *j+i>  whence  kj^^  ■  ^J+l" 

Thus  »  1  counting  chain  C(P).  Since  fv^^}  c  ^  j^)> 

a  1  E  D  c  L,  it  follows  that  C(P)  c  otCl). 


Now  sv^pose  that  the  counting  chain  C(P)  c  aT(L)  for  tom  set  P  of  positive 
Integers.  For  each  n  let 


i,  i,  i„  .\*i 
V  -  b  ^a  .  .b  “a’*“^ 


be  in  C(P).  Then  there  is  a  unique  word  d  in  L  such  that  v  is  in  orfd  ). 

n  n  A 

In  fact,  djj  is  the  word  Xj^****^  of  length  n  in  8(a,b)  for  which  (i)  Xj^  ■  a  if 
and  only  if  ij^  ■  2,  and  (ii)  for  1  s  J  <  n,  x^  -  Xj^^^  if  and  only  if  Ij^j^  ■  2. 

It  readily  follows  that  ^  is  a  sequence  in  L. 

(2)  He  shall  now  show  that  if  it  is  solvable  whether  an  arbitrary  language 
contains  a  sequence,  then  it  is  solvable  whether  an  arbitraxy 
a  counting  chain.  This  and  (1)  will  then  i^ply  Theorem  3.1. 


Let  be  the  saaw  as  in  Theorem  2.1.  Let  C(P)  be  an  arbitrary  counting 
chain.  If  u  is  a  word  in  C(P),  then  from  the  definition  of  counting  chain. 


r 

r 

i 

i 


\ 
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u  -  for  some  word  v  la  dCa^b).  ISiub  bu  is  in  for  each  word  u  in 

C(P).  Hence  bC(P)  c  L^. 

Let  M  be  a  given  language  over  S  ■  fa,b).  Let 

-  fb(0b)“a*(b4b^-rt>^a)a“'*'Vn  *  1^ 

Hg  -  ba*(b+b^4b^a)a  +  bMba  +  bMb^a, 
and  M  -  Eg. 

As  is  easily  seen,  Hg,  and  M  are  languages.  consists  of  all  words  of 
the  fora 

(o )  buba®b^a^,  wheire  t  ■  1  or  2,  u  is  in  9,  a  e  0,  and  2  <  p  <  #b(buba\^). 
Hg  consists  of  all  words  of  the  fora 

0)  baVa  or  ba“b^a®  for  t  >  1,  2,  and  a  a  0; 
and  (y)  bMb^a  for  t  ■  1,  2. 

00  Q  • 

Consider  the  language  >  b  +  b  -f  b'^-t-  L^(b-tb  )  +  M.  Ve  shall  show  that 

-contains  a  sequence  if  and  only  if  M  contains  a  counting  chain,  therefore 
if  it  is  solvable  whether  an  aurbitrary  language  contains  a  sequence,  it  is 
solvable  whether  an  arbitrary  language  contains  a  counting  chain. 

Siqn>ose  that  vb  is  an  initial  subword  of  a  counting  chain  C(P).  Ihen 
either  (l)  v  ■  c,  (il)  v  ■  b,  (ill)  v  is  in  C(P),  or  (iv)  v  ■  Vj^b  with  Vj^  in 
C(P).  Consider  bvb.  If  (i)  holds,  then  bvb  -  b^.  If  (ii)  holds,  then  bvb  ■ 
b^.  If  (iii)  holds,  then  bv  is  in  1^,  so  that  bvb  is  in  L^^b.  If  (iv)  holds. 


I 
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then  bv^  is  in  so  that  bvb  *  bv^b  is  in  L^b  .  Thus  if  vb  is  an  initial 
subword  of  a  counting  chain,  then  h  +  h'^+  Iij^(b+b  )  contrlbutea  bvb  to 

Consider  Initial  subvords  va  of  a  counting  chain.  It  is  clear  that  only 
M  can  contribute  words  ending  In  a  to  Suppose  that  va  Is  an  initial  sub> 
word  of  a  counting  chsdn.  Then  bva  >  bvb^a*^,  where  t  ■  1  or  2,  w  Is  a  word  not 
ending  In  b,  and  0  <  q  <  j|tb(bwb^).  If  w  contains  b  and  q  h  2,  then  bva  Is  in 
M  by  (or).  If  w  does  not  contain  b  cmd  either  qBlorqBtB2,  then  bva  is 
In  M  by  (0).  Thus  neither  (or)  nor  (8)  contribute  bva  to  N  If  and  only  If  bva  « 
bWj^bWgb^a  for  some  words  w^  and  Wg,  Wg  not  ending  in  b,  t  -  1  or  2. 

Suppose  that  M  contains  a  counting  chain  C(P).  By  the  previous  discussion, 
contains  bz  for  every  initial  subword  z  of  C(p)  except  possibly  z  -  w^bWgb^a, 
t  -  1  or  2,  Wg  a  word  ending  in  a.  In  this  case,  Wj^lWg  is  In  C(P),  thus  In  M. 

Hien  by  (y),  bz  Is  In  M,  thus  In  M.  Therefore  contains  bz  for  every  Inltiid 

subword  z  of  C(P).  Since  b  Is  silso  in  Mj^,  contains  every  Initial  subword  of 
bC(P).  Thus  contains  a  sequence. 

Now  suppose  that  contains  a  sequence  D.  It  is  easily  seen  that  D  begins 
with  b^.  Since  neither  b^  nor  b^  is  In  Lj^,  b*^  Is  not  in  Mj^.  Since  b^a^  is  not 
in  M,  b  a  is  not  in  M^.  Thus  D  begins  with  either  b‘ab  or  b'^a. 

(a)  Suppose  that  D  begins  with  b^ub^,  where  u  is  a  word  ending  in  a. 

Since  each  word  in  ends  in  a,  neither  b^ub  nor  b®ub^  is  in  I^.  Thus  b^ub^ 

is  not  in  M^,  so  that  D  begins  with  b^ub^a. 
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(b)  Suppose  that  D  begins  with  b^ub^a^,  where  t  *  1  or  2,  4  >  0,  and  u  is 
a  word  not  ending  in  b  (thus  u  sdght  be  c).  IVo  alternatives  arise. 

(1)  q  ■  #b(bub^).  Suppose  that  D  begins  with  b^ub^a^*^^.  Ihen 
q  +  1  a  3>  so  that  b^ub^a^'*'^  can  only  be  contributed  to  by  (or).  Since  q  1  ■ 
#b(b®ub^),  b^ub^a^'*’^  cannot  be  contributed  by  (a),  nius  D  begins  with  b^ub^aS). 

(11)  q  i  #b(bub^).  Since  D  is  a  sequence,  it  follows  frosi  (i)  that 
q  <  #b(bub^).  Suppose  that  D  begins  with  b^ub^B**b.  Then  b^ub^a^  is  in  L^,  so 
that  q  •  #b(bub^),  a  contradiction,  therefore  D  begins  with  b^ub^a**^^. 

From  (a)  and  (b)  and  the  fact  that  D  begins  with  b^ab  or  b^a,  it  follows  that 
D  consists  of  all  initial  subwords  of  bC(P)  for  some  counting  chain  C(P),  with 
ip  Ig,...  the  ejqponents  of  b's  in  the  chain. 


Finally,  6v;vpo8e  that  M  does  not  contain  C(P).  In  particular,  let 

k 

1  i  i 

X  -  b  a  .  .b  a'^ 

be  in  C(P)  but  not  M.  Consider  y  ■  bxb  ^*^a.  Cleiurly  y  is  not  contributed  to 
by  either  (a)  or  0).  Since  x  is  not  in  M,  y  is  not  contributed  to  by 
(v).  Thus  y  is  not  in  N^,  contradicting  the  assuaptlon  that  contains  D. 

Thus  H  contains  C(p),  l.e.,  M  contains  a  counting  chain  if  and  only  if  con¬ 
tains  a  sequence.  Q.E.D. 


1 

j 

5 


( 


Resaufk.  While  the  problea  of  whether  an  arbitrary  language  contains  a  counting 
chain  is,  as  yet,  unresolved,  the  problea  of  whether  an  arbitrary  language  con¬ 
tains  aU  counting  chains  is  recursively  unsolvable.  For  let  £  >  {a,b}  and  let 
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o  T  be  M  la  Thaorea  3«1*  Tor  each  language  L,  it  ie  readily  aeen  tliat 
ot(L)  containa  all  counting  ctaaina  if  and  only  If  0(a,b)-  «  c  L.  Since  L  ■ 
9(a«b)  la  recursively  unsolvable  [1],  It  foUova  that  vbetlier  0(a,b)-  c  c  L  is 
recursively  unsolvable.  Ibus  whether  ot(l),  hence  an  arbitrary  language^  con- 
tians  all  counting  chains  is  recursively  unsolvable. 

Section  U.  Special  Sequences 

In  this  section  we  consider  whether  a  language  L  contains  a  specific 
sequence  is  solvable.  Ve  exhibit  one  set  of  sequences  for  which  it  is  unsolv¬ 
able  and  another  for  which  it  is  solvable.  We  then  show  that  whether  a  language 
contains  a  sequence  of  a  special,  fom  is  unsolvable. 

Theorem  U.l.  Given  a  sequence  D  and  language  L,  it  is  recursively  unsolvable 
whether  D  c  L. 

Proof.  Let  F  be  the  set  of  all  sequences  of  the  fom 
Init  (cWj^cWgCWj. . .  ), 

where  U  w.  >  6(a,b).  Given  a  language  M,  consider  L(m}  >  Init 
i-1  ^ 

If  M  a  9(s,b),  then  D  c  L(M)  for  every  D  in  F.  If  M  ^  0(a,b),  then  D  c  L  for 
no  D  in  F.  Since  it  is  recursively  unsolvable  whether  M  >  0(a,b)  for  an  arbi¬ 
trary  language  M  fl],  it  is  recursively  unsolvable  whether  D  C  L  for  som 
specific  (or  even  one)  D  in  F. 

^^^If  L  is  a  language,  then  Init  L  is  a  language  [3]. 
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Ve  next  shov  that  it  is  solvable  Aether  a  language  contains  a  given  u.p. 
sequence. 

Tiwmia  U.l.  Given  vorda  v^>  Vy  it  is  recursively  solvable  whether  an  arbl- 
trary  language  L  contains  v^v^v^. 

Proof.  It  m  t,  then  it  is  recursively  solvable  whether  v^w^  is  in  L. 

#  # 

Suppose  that  s.  Since  w^WgW^  is  regular  and  L  is  a  language,  A  ■  w^w^w^  L 

is  a  language  and  effectively  calculable  froa  L  [1].  Since  w^WgW^  c  L  if  an& 

s  * 

only  if  A  a  w^WgV^,  it  suffices  to  show  that  whether  A  and  v^w^w^  are  equal  is 

solvable.  Let  (t^)  be  the  operation  which  mps  a  word  x  into  x^^  if  x  a 

Wj^Xj^  (x  a  x^Wj)  and  into  cp  otherwise.  Bien  A  and  w^WgW^  are  equal  if  amd  only 
if  ■  '^2*  ‘’■2'’’!^^)  *  language  and  effectively  calculable  frost 

A  [5].  Since  w^  «,  Wg  a  y^^. ..y^,  y^^  in  E.  Consider  the  generalized  sequential 
■achine  S  a  (K,E,fa},6,X,pj^),  where  K  -  fp^,. . .  ,Pyl,  X(Pj,y)  a  e  for  i  r, 
X(p,*y)  ■  A,  6(Pi,y)  -  Pj^j^  for  1  <  r,  and  6(p^,y)  a  p^,  y  m  S.  Then 

SCt2Ti(a)]  a  faVvg  in  TgTj^(A)}.  Since  TgTj,(A)  is  a  language,  srTgTj^(A)]  is  a 

language  and  effectively  calculable  from  t^t^^Ca)  ani  S  [5].  Prom  [U],  a 
language  on  one  letter  is  a  regular  set  and  is  effectively  calculable  as  a 
regular  set.  But  A  a  WjWgW^  if  and  only  if  sCtgij^CA)]  a  a*.  How  it  is  solv¬ 
able  whether  two  regular  sets  are  equal  [9].  Thus  it  is  solvable  whether 
SCTgTj^(A)]  a  a  .  Hence  the  result. 

Theorem  U.2.  Given  an  u.p.  sequence  D  and  a  language  L,  it  is  solvable  idiether 
L  contains  D. 

i 


f 
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Proof.  Let  D  be  an  u.p.  8eq,uence.  Then  D  ■  Inlt  (w(aj^.  ..ap)  ),  a^  In  Z,  for 
some  word  v  and  some  p  >  1.  For  any  language  L«  D  c  L  if  and  only  If  L  contains 
each  of  the  following  p  +  1  sets:  Init  v,  w(aj^. ..a^)  ,  va^Cag. . .^a^)  , 
va^^a^Ca^.  •  .a^a^^a^)  > . . .  jva^. .  *  Inclusion  of  Init  v  is  solv¬ 

able  because  Init  v  is  finite  and  a  language  is  a  recursive  set.  Each  of  the 
other  inclusions  is  solvable  by  4.1.  nxus  whether  L  contains  D  is 

solvable. 

We  have  Just  seen  that  whether  a  language  contains  a  specific  u.p.  seq.ueace 
is  solvable.  The  question  arises  as  to  whether  a  language  contains  at  least 
one  u.p.  sequence  with  a  given  period  is  solvable.  Ve  new  show  that  it  is  not. 

Theorem  4.3.  Given  a  language  L  and  non-<  word  w,  it  is  recursively  unsolvable 
whether  L  contains  an  u.p.  sequence  with  period  w,  that  is,  a  sequence  of  the 
form  Init  (w^w*). 

Proof.  Let .a,  b,  and  c  be  three  letters  not  occurring  in  w.  For  each  positive 
integer  J  denote  by  J  the  word  ab*^.  For  each  n- tuple  z  »  (Zj^, . . .  ,8^^)  of  words 
in  6(a,b)-c  let 

L(z)  •  fzj^  ...Zj^  c  ij^. ..ij^/k  »  1,  each  Ij  s  n]. 

Let  A(w)  ■  e  +  Init  w  -  w.  For  any  two  n-tuples  x  ■  (x^,...Xj^)  suid  y  ■ 
of  words  in  0(a,b)-s  let 

L(**y)  ■  lolt  U*)  +  LCx)  w(w^)*A(w)  +  L(y)  (w^)*A(w). 

Suppose  that  L(x)  D  L(y)  ^  cp.  Let  w^  be  an  element  in  L(x)  D  L(y).  Then 
Init  (w^w  )  c  L(x,y),  so  that  L(x,y)  contains  an  u.p.  sequence  with  period  w. 
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Suppose  that  L(x,y)  contains  an  u.p.  sequence  with  period  v.  nxen  there 
is  a  vord  such  that  Init  (v^v*)  c  L(x,y).  Zn  particular,  v^v  and  are 

in  L(x,y).  Since  L(x)  c  0(a,b,c)  and  v  is  not  in  dCa^h^c),  neither  v^v  nor 

2  2  2^ 
v^v  is  in  Init  L(x).  Stqipose  that  vr^v  and  v^v  both  are  In  L(x)v(w  )  A(v). 

2  2 

Since  a,  b,  and  c  do  not  occur  in  v,  v^v  ■  uvw  and  w^v  «  uw  ,  with  u  in 
L(x)  and  w,  vw^  in  w(w^)*A(w).  How  for  each  word  t  in  w(w^)*A(w) 

(♦)  It]  -  (2r^+  1)  Iwl  +  s^, 

where  r.  is  a  nonnegative  integer,  and  0  <  s.  x  |w|-  1.  Thus  |vw|  >  (2r  +  l) 

w  w  W 

|wl  +  8^.  Then 

Ivw^l  •  Ivwj  +  |wl 

■  ^^*'vw'*’  ''''  •vw'^  ' 

Since  0  <  s^  <  |w|-  1,  jvw^l  is  not  of  the  fona  given  in  (*).  Therefore  one 

O  9  # 

of  the  words  w^w  or  w^w  is  not  in  L(x)w(w  )  A(w).  Similarly  one  of  the  words 
WjW  or  WjW  is  not  in  L(y)(w  )  A(w).  Thus  w^w  is  in  one  of  the  two  sets, 
L(x)w(w^)  A(w)  or  L(y)(w^)*A(w),  and  WjW^  is  in  the  other.  Therefore  w^^  ■ 

Ul^l  -  “1  \  ^  Vg  in  Init  (w*)  +  e.  Since 

Uj^Ug  and  Vj^Vg  have  no  coaaon  letters,  u^  -  Ug  and  u^  is  in  L(x)  0  L(y).  Thus 
L(x)  n  L(y)  cp. 

It  thus  follows  that  a  necessary  and  sufficient  condition  that  L(x,y) 
contain  an  u.p.  sequence  with  period  w  is  that  L(x)  D  L(y)  4  9.  But 
L(x)  n  L(y)  4  cp  occurs  if  and  only  if  there  exists  a  sequence  of  integers 

tee  80  e  e  e  ^ 


Since  the  latter  is  the  Poet 
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Correspondence  Problem  and  is  knovn  to  be  recursively  unsolvable  [8],  whether 
l(x,y)  contains  an  u.p.  sequence  of  period  v  is  recursively  unsolvable. 

Remark.  The  family  of  languages  L(x)  of  Theorem  4.3  is  also  useful  in  showing 
the  following  questions  about  sequences  and  an  arbitrary  language  L.  to  be 
recursively  unsolvable: 

(1)  Whether  L  contains  a  sequence  containing  a  fixed  letter.  For  let  d 
be  a  letter  not  occurring  in  L(x)  +  L(y)  and  consider  [init  L(x)](d^)*+ 

[Init  L(y)]d(d^)*. 

(2)  Whether  L  contains  a  purely  periodic  sequence. For  let  d  be  a 
letter  not  occurring  in  L(x)  +  L(y)  and  consider  Init  fdLCxXdLCy))*]. 

We  conclude  with  a  remark  on  the  unsolvablllty  of  whether  a  language  con¬ 
tains  an  u.p.  sequence.  Let  D,  and  be  as  in  Theorem  2.1.  It  follcws 

from  the  proof  of  Theorem  2.1  that,  for  any  word  w  in  9(a,b),  Init  w  c  if 
and  only  if  w  is  in  D.  Let  t  be  the  operation  such  that  for  each  word  u,  t(u)  ■ 
u  if  the  letter  c  occurs  in  u,  and  t(u)  -  tp  otherwise.  If  L  is  a  language,  then 
t(L)  Is  a  language  C^].  Thus  T(lnit  L(x)}  is  a  language.  Consider  the 

M(x,y)  -  1^+  L(x)(d®)*+  L(y)d(d^)*+  T(lnit  L(x)). 

Then  M(x,y)  contains  an  u.p.  sequence  if  and  only  if  there  exist  integers 

^  •••*<  ■  y*  •••y*  this  word  is  in  D. 

a  J  ^1  ^k  ^1  ^k 

(9) 

'"A  sequence  of  words  D  -  x^.  is  said  to  be  purely  periodic  if  there 

exists  an  integer  m  a  1  so  that  x. 


for  all  i  a  1. 
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Zn  other  vordSj  vhether  a  language  contains  an  u.p.  sequence  Is  recursively 
unsolvable  if  the  foUoving  modification  of  the  Post  Correspondence  Problem  is 
true:  Given  tvo  n-tuples  and  (yj^>..<>yQ)  of  vords  in  9(a,b)-e,  it 

is  recursively  unsolvable  vhether  there  is  a  sequence  of  integers  i^, ...,i^. 


each  i.  <  n^  so  that  x.  ...x.  *  y.  ...y. 

**  ^1 

p  O  iW  i  * 

bbaba^*^. . .  e 


and  X .  . . . X . 


begins  the  sequence 
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APPENDIX 

Proof.  We  first  recall  sobm  teminology  and  facts  about  generation  tx«es.  Let 
G  -  (V,P,SjS)  be  a  gramnar.  Call  the  eleneats  of  V-E  variables.  Let  be  a 
variable.  Let  Vg, ...,w^  be  words  in  6(v),  **  *2  production,  with  the  foUov- 

Ing  property.  For  2  <  i  <  r  there  exist  words  u^,  v^,  such  that  w^  ■ 

'^i+l  ■  “i®i^i*  ^1  ■*  *1  production.  A  generation  tree  (con¬ 

structed  below)  is  a  rooted,  directed  tree  with  an  element  of  V  U(c},  called 
the  node  name,  associated  at  each  node. 

Die  nodes  of  the  tree  are  certain  tuples  of  the  fom  (i^, ...,i^),  where 
k  £  r  and  Ij  is  a  positive  integer,  n^e  directed  lines  of  the  tree  are  all  the 
ordered  pairs  (lp...,ij^,  of  nodes.  Let  the  l-ti^le  (1) 

be  the  root  and  w^  the  node  name  of  (l).  If  w^  >  c  let  (1,1)  be  a  node  in  the 
tree  and  c  the  node  name  of  (1,1).  If  Wg  -  • -*1(2)* 

(l,i),  1  <  1  <  n(2),  be  a  node  amd  its  node  name.  Continuing  by  induction, 
suppose  that  for  all  t  <  k,  every  occurrence  in  w.  of  an  eleannt  of  V  serves  as 
node  nam  of  some  node.  Now 

Vk^-^-'^k+i- Wk* 

let  (i^, ...,i^)  be  the  node  whose  node  nane  is  the  occurrence  of  y^  indicated 

in  (*).  If  ■  «  let  (lp...,i^,l)  be  a  node  suid  c  its  node  naae.  if  ■ 

(k)  fk)  fk^ 

^  each  in  V,  let  (i^,...,i^,l),  1  c  i  k  n(k),  be  a  node  and 

(k) 

'  its  node  name.  Ihls  procedure  ia  repeated  through  k  ■  r-1.  The  resulting 


entity  is  the  generation  tree. 


4 
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A  node  iB  said  to  be  an  extension  of  the  node  (lj^,...,ig)  if 

•  a  t  and  for  all  k  a  s. 

A  path  In  a  generation  tree  is  a  sequence  of  nodes  such  that 

is  a  directed  line  for  each  1  a  k-1. 

Given  the  nodes  ■  (ij^,...>lg)  and  Hg  -  write  a  if 

either  Ng  is  an  extension  of  or  if  ij^  <  for  the  smallest  integer  k  such 
that  ij^  ^ 

nie  relation  a  is  a  simple  order  on  the  set  of  nodes. 

A  node  is  ccdled  maximal  if  there  is  no  node  distinct  froa  it  which  is  an 
extension  of  it. 

We  shall  use  (implicitly  and  explicitly)  the  following  known  facts  about 
a  generation  tree  T  associated  with  C  *  w  fl]: 

(a)  If  H  is  a  nonmaxlmal  node,  then  the  node  name  x  of  H  is  a  variable 
and  S  •  uxv  for  sosw  u  and  v  in  0(V). 

(b)  Let  ...,>1^  be  the  maximal  nodes,  with  "1  ‘  "i+l  for  each  i.  Then 
w  is  the  word  obtained  by  replacing  in  N^. .  .Hj^  each  node  with  its  node  name. 

(c)  Let  II  be  a  nonmaxlmal  node  in  a  generation  tree,  and  x  its  node  nasw. 
Then  the  "subtree"  of  T  fonsed  by  using  as  nodes  all  extensions  of  If  is  a 
generation  tree. 

(d)  Let  w  ■  uyv  and  let  be  a  generation  tree  of  y  *  w^^.  If  T^  is 
placed  (in  the  obvious  way)  with  its  root  on  the  node  idiose  node  name  is  y  in 
uyv,  then  a  generation  tree  of  {  *  uw^v  is  obtained. 
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We  now  return  to  the  proof  of  nieorem  2,3  •  Lat  D  he  a  eequeiuse  aatiafying 
the  hypothesis  of  the  theorem.  Let  L  be  any  language  containing  D.  Ve  shall 
show  that  D  contains  an  u.p.  seq,uance.  Consider  the  set  L'  ■  L-fs}-£.  L'  Is  a 
language  and  there  is  a  gramar  G  -  (V,P,r,S),  L(g)  ■  L* ,  such  that  every  pro¬ 
duction  In  P  Is  of  the  form  ^  ^  and  v  In  V  [!]•  H  denote  the  number 

of  distinct  variables.  Let  B  be  the  set  of  those  variables  (  such  that  {  •• 
a*Sa^  for  some  s  +  t  >  0.  Let  be  the  set  of  those  $  In  E  such  that  $  ^  ^a^ 
for  sane  t  >  0.  fVe  can  effectively  determine  H  and  E^,  but  ve  do  not  need 
this  fact.]  Ve  shall  see  below  that  E  Is  nonempty.  Denote  the  distinct  elements 
of  B  by  For  each  la  Bj^  let  e(i)  >  0  be  an  integer  such  that 

For  each  5^  in  E-Bj  let  e(l)  >  0,  s(i),  t(l)  be  Integers  such 
that  e(i)  ■  s(i)  +  t(i)  and  5^  Let  e  ■  e(l)...e(r). 

Consider  any  word  ua^  in  D,  vhere  u  c  emd  k  »  We  shall  show 

that  Inlt  (ua  )  c  L,  thereby  proving  the  theorem.  Since  Inlt  (ua  )  c  D  c  L, 
it  suffices  to  show  that  ua^  is  In  L(g)  for  each  q  >  k.  Accordingly,  let  q  >  k 
be  given  and  let  p  -  |u|.  Ihea  k-2^  »  2(21l+e)p_g2EP  .  2^(2®^-l)  »  2(2*®-l) 
a  2*^  >  ep  a  e.  Therefore  there  is  a  positive  integer  g  such  that  2^®^  < 
q-ge  «  k.  Then  ua^”®*  is  in  Inlt  (ua^)  and  lua*^’®*]  a  2.  Thus  ua^“®*  is  in 
L(g).  Bence  there  is  a  generation  tree  T  of  g  vhlch  derives  ua*^~®*  (from  S). 

Since  each  production  is  of  the  form  ?  ^v,  u  and  v  in  V,  it  is  readily  seen 

that  euiy  generation  tree  of  G  of  a  word  of  length  >  2°  contains  a  path  with  at 
least  nt-1  nodes,  vhere  each  node  naan  is  a  vsuriable.  Nov  |ua^~®*|  >  q-ge  >  2^^a 
2**^^'*’^^  Thus  T  contains  a  path  2j^»***  vhere  the  node  nasie  of  each 
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is  a  variable.  Since  there  are  only  H  dietlnct  variables,  one  of  tbea,  say 
is  the  node  aeae  of  at  least  p  t  2  nodes.  Denote  by  Yj^,. . . first 

p  4-  2  nodes  in  the  path  whose  node  naaie  is  (.  For  1  <  i  i£  p42,  let  be  the 

subtree  of  T  whose  nodes  are  the  extensions  of  Y^.  Then  is  a  generation  tree 
(frosk  of  a  word  in  d(Y)-c.  For  1  <  i  <  p+1,  since  the  zujde  Y^^^  occurs 
in  there  are  words  Xj^,  y^^  in  9(i:)  such  that  5  •  *1?^! 

Since  each  production  is  of  the  form  y  -•  uv,  u  and  v  in  V,  x^y^?^  *•  Since  Y^ 

is  in  T,  there  exist  Wj^,  Wg  in  8(v)  such  that  8  •  ua*^”®®  ■ 

''i*!'  *  *  ^+i^p+2yp+i'  •  •  ^a'2' 


Two  cases  arise. 


(1)  Suppose  that  one  of  the  x^  is  c.  Let  J  be  the  smallest  integer  such 
that  Xj  ■  c.  Ihen  |xj^. . .Xj_j^|  a  J-1.  Since  e  for  each  i,  ^*j+i****p+i 

^p42^p+l*  *  *^J+1^  *  P+1“J*  ^*l*"^j+ll  *  u  is  an  initial  sub- 

word  of  Therefore  y^  is  in  a  .  As  Xj  ■  s,  yj^  *.  Thus  yj  is  in 

# 

aa  .  Since  5  ®*  ®*y  ?  “  5^*  e  is  a  multiple  of  e(d). 

Thus  5  •  .  ?a®®  and 


S  -  Wj^Xj^...Xj?yj...yjWg 

-w^x^...*j?a8*yj...y^Wg 

*  '^l*!'  •  •  Vl^IH2^P+l* '  ’  •  •  •  ^l''2 

-  ua'l-®*a8*  -  ua^. 


(2)  Suppose  that  none  of  the  is  s.  Then  |xj^...Jtpl  >  p,  so  that  u  is 
aa  initial  subword  of  Wj^Xj^...j^.  Thus  •  •  yi»2  ^® 
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5  **  *ip+l^P+l  ^  *■*  H,  say  S  -  5^. 

there  exist  oonnegative  Integers  s  and  t  so  that  S  **  a*Sa^  and  e(d)  ■  s 
Thus 

•  w  x,...x  V  ^  vw 

WiXj^. . .  XpS  *p+l  P+2^P+1  ^P'  "  ^12 

(8+t)ge/e(d) 


''iV-*ViWp+i'--W' 

.l«c 


is  in  aa 


a 

■  ua^*. 


Then 
+  t. 
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